Corpus-Centered Computation

نویسنده

  • Eiichiro Sumita
چکیده

To achieve translation technology that is adequate for speech-to-speech translation (S2S), this paper introduces a new attempt named Corpus-Centered Computation, (abbreviated to C and pronounced c-cube). As opposed to conventional approaches adopted by machine translation systems for written language, C places corpora at the center of the technology. For example, translation knowledge is extracted from corpora, translation quality is gauged by referring to corpora and the corpora themselves are normalized by paraphrasing or filtering. High-quality translation has been demonstrated in the domain of travel conversation, and the prospects of this approach are promising due to the benefits of synergistic effects.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Multidisciplinary Student-centered Laboratory

1 Texas A&M University -Corpus Christi Department of Computing and Mathematical Sciences, [email protected], [email protected], [email protected], [email protected] Abstract  In this paper we describe a student-centered laboratory developed by the Department of Computing and Mathematical Sciences at Texas A&M University – Corpus Christi and partially supported...

متن کامل

A corpus-centered approach to spoken language translation

This paper reports the latest performance of components and features of a project named CorpusCentered Computation (C'3), which targets a translation technology suitable for spoken language translation. C3 places corpora at the center of the technology. Translation knowledge is extracted from corpora by both EBMT and SMT methods, translation quality is gauged by referring to corpora, the best t...

متن کامل

EBMT, SMT, hybrid and more: ATR spoken language translation system

This paper introduces ATR’s project named Corpus-Centered Computation (C3), which aims at developing a translation technology suitable for spoken language translation. C3 places corpora at the center of its technology. Translation knowledge is extracted from corpora, translation quality is gauged by referring to corpora, the best translation among multiple-engine outputs is selected based on co...

متن کامل

Human-Centered Analysis and Visualization Tools for the Blogosphere

Blogging has become a new and disruptive communication medium. Blogs have changed the way people and organizations express, interact, and—quite unforeseen—exercise influence. The digital nature of the blog media provides access to an always-expanding corpus of information. It would take more than a lifetime to read all the available blogs necessary to answer questions such as what were the more...

متن کامل

Modeling Narrative-Centered Tutorial Decision Making in Guided Discovery Learning

Interactive narrative-centered learning environments offer significant potential for scaffolding guided discovery learning in rich virtual storyworlds while creating engaging and pedagogically effective experiences. Within these environments students actively participate in problem-solving activities. A significant challenge posed by narrative-centered learning environments is devising accurate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002